The Astronomy Commons Platform: A Deployable Cloud-based Analysis Platform for Astronomy

نویسندگان

چکیده

Abstract We present a scalable, cloud-based science platform solution designed to enable next-to-the-data analyses of terabyte-scale astronomical tabular data sets. The presented is built on Amazon Web Services (over Kubernetes and S3 abstraction layers), utilizes Apache Spark the Astronomy eXtensions for parallel analysis manipulation, provides familiar JupyterHub web-accessible front end user access. outline architecture platform, provide implementation details rationale (and against) technology choices, verify scalability through strong weak scaling tests, demonstrate usability an example from Zwicky Transient Facility’s 1Bn+ light-curve catalog. Furthermore, we show how this system enables iteratively build (in Python) that transparently scale processing with no need end-user interaction. be deployable by astronomers moderate cloud engineering knowledge, or (ideally) IT groups. Over past 3 yr, it has been utilized platforms DiRAC Institute, ZTF partnership, LSST Solar System Science Collaboration, Interdisciplinary Network Collaboration Computing, as well numerous short-term events (with over 100 simultaneous users). In live demo instance, deployment scripts, source code, cost calculators are accessible. 4 http://hub.astronomycommons.org/

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Modern Cloud-Based Platform

ing and Library Use: Abstracting is permitted with credit to the source. Libraries are permitted to photocopy for private use of patrons, provided the per-copy fee indicated in the code at the bottom of the rst page is paid through the Copyright Clearance Center, 222 Rosewood Drive, Danvers, MA 01923. NEXT ISSUE:

متن کامل

ASTROMLSKIT: A New Statistical Machine Learning Toolkit: A Platform for Data Analytics in Astronomy

Astroinformatics is a new impact area in the world of astronomy, occasionally called the final frontier, where several astrophysicists, statisticians and computer scientists work together to tackle various data intensive astronomical problems. Exponential growth in the data volume and increased complexity of the data augments difficult questions to the existing challenges. Classical problems in...

متن کامل

Sage Bionetworks Commons and Platform

Perspective: A Need for Better Maps of Disease The revolution in health care that was anticipated by the sequencing of the human genome has failed to materialize 1. The failure rate for drugs in clinical development is still startlingly high despite unprecedented investment in R&D that reached a record $65 billion in 2009. This is largely due to the very high attrition rate for compounds in cli...

متن کامل

Biologic Data Analysis Platform Based on the Cloud

To improve the research productivity in bioinformatics study by using effective means of large scale data analysis, there are many obstacles that need to be overcome They are standardization of data collection and analysis, management of computing and storage resources, easiness of parallel programming, and efficiency of data analysis job execution, to name a few. Among these, easiness of paral...

متن کامل

Astronomy in the Cloud: Using MapReduce for Image Coaddition

In the coming decade, astronomical surveys of the sky will generate tens of terabytes of images and detect hundreds of millions of sources every night. The study of these sources will involve computation challenges such as anomaly detection and classification and moving-object tracking. Since such studies benefit from the highest-quality data, methods such as image co-addition, i.e., astrometri...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: The Astronomical Journal

سال: 2022

ISSN: ['1538-3881', '0004-6256']

DOI: https://doi.org/10.3847/1538-3881/ac77fb